# Low-Latency Speech Processing
Ultravox V0 3
MIT
Ultravox is a multimodal speech large language model based on Llama3.1-8B-Instruct and Whisper-small, capable of processing both speech and text inputs.
Audio-to-Text
Transformers English

U
FriendliAI
20
1
Ultravox V0 3
MIT
Ultravox is a multimodal speech large language model built upon Llama3.1-8B-Instruct and Whisper-small, capable of processing both speech and text inputs.
Text-to-Audio
Transformers English

U
fixie-ai
48.30k
17
Featured Recommended AI Models